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We identify crucial design issues in building a distributed inverted index 
for a large collection of Web pages. We introduce a novel pipelining 
technique for structuring the core index-building system that 
substantially reduces the index construction ... 
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Query languages for retrieval of XML documents allow for conditions 
referring both to the content and the structure of documents. In this 
paper, we investigate two different approaches for reducing index space 
of inverted files for XML documents. First, ... 
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Query-processing costs on large text databases are dominated by the 
need to retrieve and scan the inverted list of each query term. Retrieval 
time for inverted lists can be greatly reduced by the use of compression, 
but this adds to the CPU time required. ... 
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Compression reduces both the size of indexes and the time needed to 
evaluate queries. In this paper, we revisit the compression of inverted 
lists of document postings that store the position and frequency of 
indexed terms, considering two approaches ... 
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Two well-known indexing methods are inverted files and signature files. 
We have undertaken a detailed comparison of these two approaches in 
the context of text indexing, paying particular attention to query 
evaluation speed and space requirements. We ... 
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